Acoustic - labial speaker verification 1
نویسندگان
چکیده
This paper describes a multimodal approach for speaker verification. The system consists of two classifiers, one using visual features, the other using acoustic features. A lip tracker is used to extract visual information from the speaking face which provides shape and intensity features. We describe an approach for normalizing and mapping different modalities onto a common confidence interval. We also describe a novel method for integrating the scores of multiple classifiers. Verification experiments are reported for the individual modalities and for the combined classifier. The integrated system outperformed each sub-system and reduced the false acceptance rate of the acoustic sub-system from 2.3% to 0.5%. q 1997 Elsevier Science B.V.
منابع مشابه
Acoustic-labial Speaker Verification Pattern Recognition Letters Acoustic-labial Speaker Verification
This paper describes a multimodal approach for speaker veri cation The system consists of two classi ers one using visual features and the other using acoustic features A lip tracker is used to extract visual information from the speaking face which provides shape and intensity features We describe an approach for normalizing and mapping di erent modalities onto a common con dence interval We a...
متن کاملAudiovisual speaker identity verification based on lip motion features
In this paper, we propose the fusion of audio and explicit lip motion features for speaker identity verification applications. Experimental results using GMM-based speaker models indicate that audiovisual fusion with explicit lip motion information provides significant performance improvement for verifying both the speaker identity and the liveness, due to tracking of the closely coupled acoust...
متن کاملAcoustic-labial Speaker Verification
This paper describes a multimodal approach for speaker ver-iication. The system consists of two classiers, one using visual features and the other using acoustic features. A lip tracker is used to extract visual information from the speaking face which provides shape and intensity features. We describe an approach for normalizing and mapping dierent modalities onto a common conndence interval. ...
متن کاملAcoustic-labial Speaker Verication 1 This Work Has Been Performed within the Framework of the M2vts (multi Modal
This paper describes a multimodal approach for speaker verication. The system consists of two classiers, one using visual features and the other using acoustic features. A lip tracker is used to extract visual information from the speaking face which provides shape and intensity features. We describe an approach for normalizing and mapping dierent modalities onto a common conndence interval. We...
متن کاملUsing Exciting and Spectral Envelope Information and Matrix Quantization for Improvement of the Speaker Verification Systems
Speaker verification from talking a few words of sentences has many applications. Many methods as DTW, HMM, VQ and MQ can be used for speaker verification. We applied MQ for its precise, reliable and robust performance with computational simplicity. We also used pitch frequency and log gain contour for further improvement of the system performance.
متن کامل